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Preface 


Goodness-of-f i t  tests  are  developed  for  the  Logistic 
distribution  when  the  location  and  scale  parameters  are  un¬ 
known.  The  Kolmogorov-Smi rnov ,  Anderson-Darling  and  Cramer- 
von  Mises  statistics  are  used  to  develop  tables  of  critical 
values  to  be  used  in  goodness-of-f i t  hypothesis  testing.  A 
power  study  is  conducted  to  compare  the  Kolmogorov-Smi rnov , 
Anderson-Darling  and  Cramer-von  Mises  goodness-of-f i t  tests. 
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Abstrac t 


The  method  of  maximum  likelihood  is  used  to  determine 
invariant  estimates  of  the  unknown  location  and  scale  para¬ 
meters  of  a  sample  from  the  Logistic  distribution.  The  par¬ 
tial  derivatives  of  the  likelihood  function  can  not  be  solved 
explicitly,  therefore  the  Secant  method  is  used  to  iteratively 
determine  the  roots  of  the  partial  derivatives.  Using  these 
estimates,  modified  Kolmogorov-Smi rnov ,  Anderson-Darling  and 
Cramer-von  Mises  statistics  are  calculated  for  a  given  sample. 
This  procedure  is  repeated  5000  times  for  sample  sizes  of 
n=  5<5>30.  The  80th,  85th,  90th,  95th  and  99th  percentiles  of 
the  distribution  of  each  statistic,  for  each  sample  size,  is 
then  calculated.  These  values  are  then  used  to  generate 

tables  of  critical  values  for  the  Logistic  distribution  with 

unknown  location  and  scale  parameters.  A  power  comparison 
between  the  three  tests  is  performed  using  samples  from  va¬ 
rious  distributions. 

The  Secant  method  requires  “good*  initial  estimates  of 
the  parameters  in  order  to  converge.  This  thesis  uses  the 
sample  mean  and  standard  deviation  as  initial  estimates.  In 
four  of  the  total  30,000  samples  used,  these  initial  estimates 
did  not  allow  convergence.  Ulh  i  1  e  discarding  these  samples 
biases  the  theoretical  results,  it  was  determined  that  discar¬ 
ding  these  samples  would  not  biases  the  numerical  results. 

This  does  however  place  a  constraint  on  using  the  Secant 


MODIFIED  KOLMOGOROV -SMI RNOV ,  ANDERSON- DARLING  AND 
CRAMER-VON  MISES  TESTS  FOR  THE  LOGISTIC  DISTRIBUTION 
WITH  UNKNOWN  LOCATION  AND  SCALE  PARAMETERS 

I.  INTRODUCTION 

The  Air  Force  is  highly  interested  in  the  reliability  and 
maintainability  of  proposed  systems.  When  no  historical  data 
exits,  statistical  and  probabilistic  measures  are  often  the 
only  approaches  possible  in  gathering  meaningful  information 
to  aid  the  decision  maker.  The  mean  time  to  failure  and  the 
failure  rate  of  a  proposed  system  are  usually  unknown  but 
important  considerations  in  the  decision  making  process. 

Various  methods  can  be  used  to  collect,  for  example,  time 
to  failure  data  from  a  experimental  or  prototype  system.  The 
data  can  then  be  compared  to  a  theoretical  probability  distri¬ 
bution.  How  well  the  distribution  of  the  experimental  data 
matchs  the  theoretical  distribution  is  known  as  a  "goodness- 
of-fit  test".  If  such  tests  show  that  the  distribution  of 
the  experimental  data  "fits"  the  theoretical  distribution 
well,  the  hypothesized  theoretical  distribution  can  be  used  in 
simulation  modeling,  for  example,  to  predict  the  failure  rate 
of  the  proposed  system.  The  Gamma  distribution  has  often  been 
used  in  such  studies,  and  its  hazard  function  approaches  a 
constant  value.  The  hazard  function  of  the  Logistic  distribu¬ 
tion  also  approaches  a  constant  value,  as  time  approachs 
infinity,  and  is  therefore  a  useful  alternative  in  some  relia¬ 
bility  and  life-testing  situations. 


BACKGROUND 


There  are  several  classical  goodness-of-f  i  t  tests,  such 
as  the  Chi-Square  test,  the  Kolmogorov-Smi rnov  test  (KS>,  the 
Anderson-Darling  test  <A^>  f  and  the  Cramer-yon  Mises  test  (Ul^)  . 

Of  these  tests,  the  two  most  popular  are  the  Chi-Square  and 
the  Kolmogorov-Sm i rnov  tests.  The  Chi-square  test  compares 
the  observed  frequencies  of  the  empirical  distribution  to  the 
expected  frequencies  of  the  hypothesized  distribution.  How¬ 
ever,  because  it  groups  observations  (with  a  minimum  of  five 
observations  per  cell)  it  is  restricted  to  larger  samples 
<2:73).  The  KS  test,  on  the  other  hand,  has  no  such  restric¬ 
tion.  This  test  uses  as  a  measure  of  fit  the  absolute  differ¬ 
ence  between  the  empirical  distribution  and  the  hypothesized 
distribution.  Because  of  this,  no  grouping  of  data  is  re¬ 
quired  and  smaller  samples  sizes  can  be  accommodated.  The 
classical  KS  goodness-of-f i t  test  is  valid  to  test  whether  a 
set  of  observations  comes  from  a  completely  specified  distribu¬ 
tion. 

However,  in  practical  applications  the  distribution  is 
seldom  fully  specified.  In  cases  where  the  parameters  must  be 
estimated  from  the  sample,  the  Chi-square  test  is  easily  adjusted 
by  reducing  the  number  of  degrees  of  freedom  by  the  number  of 
parameters  estimated.  The  KS  test  can  also  be  modified  to 
consider  the  case  where  the  parameters  are  estimated  from  the 
sample  data.  H.W.  Lilliefors  developed  a  modified  KS  goodness- 
of-f  it  test  for  the  Normal  distribution  (30)  in  1967  and  for 


the  Exponential  distribution  (31)  in  1969.  R.  Cortes  deve¬ 
loped  a  modified  KS  test  in  1980  (16)  to  be  used  with  the 

Gamma  and  Uleibull  distributions.  J.G.  Bush,  in  1981,  developed 
2  2 

modified  A  ancj  y  tests  to  be  used  with  the  UJeibull  distribu- 

2 

tion  (13).  In  1982  F.J.  Viviano  developed  modified  KS,  A 
2 

and  U1  tests  for  the  Gamma  distribution  (42).  The  KS  test, 

2  2 

as  well  as  the  A  and  U1  tests,  were  modified  for  the  Uniform, 
Normal,  Laplace,  Exponential  and  Cauchy  distributions  by  Green 
and  Hegazy  (18)  in  1976.  In  1981,  Koutrouvelis  and  Keller- 
me i er  developed  a  goodness-of-f i t  test  based  on  the  empirical 
charac ter i st i c  function  when  the  characteristic  function  is  a 
member  of  a  specified  parametric  family  of  such  functions 
<28) .  Masaak i  ,  Hiroshi  and  Shi  geo,  in  1980,  developed  a 
goodness-of-f i t  test  for  the  extreme  value  distribution  based 
on  the  entropy  of  the  sample  data  (32). 

EMPIRICAL  DISTRIBUTION  FUNCTION 

A  class  of  statistics  based  on  a  comparison  between  the 
theoretical  cumulative  distribution  function  F(x)  and  the 
sample  cumulative  distribution  function  S(x)  is  generally 
called  empirical  distribution  function  (EDF)  statistics.  His¬ 
torically,  EOF  statistics  are  used  in  cases  were  parameters 
are  estimated  from  the  sample  observations.  The  EDF  of  a 
random  sample  is  defined  as 


r  1,1  T 


In  general,  EDF  tests  are  valid  whenever  the  hypothesized 


distribution  is  fully  specified.  The  probability  integral 
transformation  converts  the  values  of  a  completely  specified 
cumulative  distribution  to  ordered  values  from  a  uniform  dis¬ 
tribution  over  the  interval  zero  to  one  <17:184).  However  in 
practice  the  distribution  is  seldom  fully  specified.  Hence, 
the  probability  integral  transform,  by  itself,  is  not  enough 
help.  David  and  Johnson  (17)  have  however  shown  that  when 
location  and  scale  are  the  parameters  being  estimated,  the 
cumulative  distribution  of  EDF  statistics  depends  on  the  func¬ 
tional  form  of  the  distribution  rather  than  the  estimated 
parameters.  This  allows  the  probability  integral  transform  to 
remain  valid  in  certain  cases  when  the  distribution  is  not 
completely  specified.  It  is  this  quality,  coupled  with  in¬ 
variant  estimates  of  the  parameters,  that  allows  the  gene¬ 
ration  of  valid  critical  value  tables  dependent  only  on  n  and 
the  significance  level  (a). 


r.v 
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THE  KOLMOGORO^-SM I RNOU  STATISTIC 

The  Kolmogorov-Smi rnov  statistic  is  defined  as  the  abso¬ 
lute  difference  between  F<x>  and  S<x).  Yet,  since  we  are 
interested  in  the  greatest  discrepancy  between  the  distribu¬ 
tions,  our  test  statistic  is 
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KS  =  sup  |  F<  x  >  -  S< x ) 


In  effect,  this  measures  the  superium  of  the  absolute  vertical 
discrepancy  between  the  hypothesized  distribution  and  the 
empirical  distribution  <15:346). 


THE  ANDERSON-DARLING  STATISTIC 


Goodness— of — f i t  tests  based  on  the  difference  between 
empirical  and  hypothesized  distributions  almost  always  have 
smaller  discrepancies  in  the  tails  of  the  distribution  <40>. 
To  overcome  this,  several  things  can  be  done.  The  most  popu¬ 
lar  is  to  weight  the  squared  differences  between  distribu- 
tions.  The  A  statistic  is  an  example  of  such  an  approach. 

It  is  based  on  a  nonnegative,  weighted  average  of  the  squared 
discrepancy.  That  is 


A2  =  n  [S<x>-F<x)]20CF<x)Idx 


where 


F< x )  I  =  t(F(x)Xl -F < x )  )  I 


'^-•Vvlvv'v 


In  computational  form 


-n  -  _  Z<2j-l>[ln 

n  j  =  1 


F<x  >+ln<l-F<x  .)] 
J  n+l-j 


In  effect,  this  accentuates  the  difference  between  F<x)  and 
S<x>  in  the  tails  of  the  distribution  <40>. 


THE  CRAMER-VON  MISES  STATISTIC 

Another  example  of  this  approach  is  the  Cramer-won  Mises 
statistic.  In  this  case  the  weighting  function  OCF(x>]  equals 
one.  This  defines  the  statistic  as 


u  =  J 


“  CS<x>-F<x)  ]2dx 
-00 


(7 


with  a  computational  form  of 


2  1  n  2 
W  =  _  +  Z  IF<x  .)  -  <  ( 2j -1  )/2n >  J  <8: 

1 2n  j=l  * 


In  effect,  this  equally  accentuates  the  differences  between 


F<  x )  and  S<x)  <40> 


V*  7'  ^  ^  •> 
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0 
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PROBLEM  STATEMENT 

Application  of  a  particular  distribution  to  a  problem 
often  is  hampered  in  one  of  two  ways.  Either  there  is  a  lack 
of  knowledge  about  the  parameters  of  a  specific  distribution 
or  there  is  not  a  method  to  easily  test  if  a  set  of  observa 
tions  are  a  random  sample  from  a  specific  distribution.  The 
former  problem  is  overcome  by  theoretical  investigations  of 
the  parameters  and  characteristics  of  the  distribution.  The 
latter  problem  is  overcome,  generally,  by  development  of  a 
general  test  statistic  or  a  table  of  critical  values  used  in 
goodness-of-f i t  hypothesis  testing  for  various  sample  sizes 
and  specific  parameters. 

Numerous  investigations  of  the  parameters  of  the  Logistic 
distribution  have  been  accomplished.  These  investigations  have 
used  various  methods.  For  example,  least  squares,  maximum 
likelihood  and  best  linear  unbiased  estimates  of  the  parame¬ 
ters  have  all  been  accomplished.  Based  on  asymptotic  distri¬ 
bution  theory,  Stephens  has  developed  critical  values  to  apply 
the  logistic  distribution  to  goodness-of-f i t  hypothesis  testing 
<41).  However,  his  test  statistics  are  highly  modified  for  the 
asymptotic  theory  to  hold  true.  No  known  effort  has  been 
done,  based  on  finite  distribution  theory,  to  easily  apply  the 
Logistic  distribution  to  goodness-of-f i t  hypothesis  testing. 
Because  of  its  applicability  in  reliability,  there  is  a  need 
to  develop  such  a  set  of  critical  values  tables  for  various 
sample  sizes  when  the  location  and  scale  parameters  are  esti- 
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mated  from  the  sample. 


OBJECTIVES 

This  thesis  has  the  following  objectives: 

1.  Based  on  finite  distribution  theory,  generate  and 
document  modified  Kolmogorov-Smi  mow ,  Anderson-Darling,  and 
Cramer-won  Mises  rejection  tables  for  the  two  parameter  Logis¬ 
tic  distribution  where  the  location  and  scale  parameters  are 
unknown . 

2.  Conduct  a  power  comparison  between  the  Kolmogorov- 
Smirnov,  Anderson-Darling  and  Cramer-won  Mises  goodness-of-f i t 


Although  the  name  might  imply  it,  the  logistic  distribu¬ 
tion  does  not  have  any  special  association  with  the  -fields  o-f 
supply  and  maintenance  or  the  -function  o-f  logistical  support. 
The  distribution  initially  was  used  in  a  study  o-f  population 
growth  by  Verhulst  in  1845.  In  1920,  Pearl  and  Reed  renewed 
interst  in  the  distribution  with  their  use  o-f  it  in  a  study  o-f 
population  growth  in  the  United  States  (33).  In  the  mid- 
1940's  Dr.  Berkson  began  using  the  distribution  with  respect 
to  studies  in  the  biological  sciences  (5).  He  established 
applications  -for  the  distribution  in  the  study  o-f  autocataly¬ 
sis,  electro-chemical  reactions  and  biochemistry.  Reed  and 
Berkson  used  the  logistic  distribution  in  phys i ochem i cal  phe¬ 
nomena  studies  in  1929  (36).  Bioassay  applications  were  est¬ 
ablished  by  Id i  Ison  and  Worchester  (43)  in  1943  and  by  Berkson 
between  1944  and  1957  (5,6,7).  In  1958,  Birnbaum  used  the 
distribution  in  a  study  o-f  mental  ability  (10).  Recently,  in 
1981,  Leach  (29)  used  the  logistic  distribution  in  a  study  on 
the  original  subject  o-f  population  growth. 

The  logistic  distribution  was  introduced  to  the  reliabi¬ 
lity  and  maintainability  -field  when  Plackett  used  it  in  a 
study  o-f  life-test  data  (34).  Shah,  in  1965,  showed  its 
applicability  in  psychometrics,  a  field  of  interest  to  the 
reliability  engineer  (39).  Bain  (4)  showed  that,  like  the 
gamma  distribution,  the  hazard  function  of  the  logistic  dis- 


tribution  approaches  a  constant  and  is  therefore  a  useful 
alternative  in  reliability  and  life  testing  situations. 

Much  work  has  also  been  done  with  respect  to  estimation 
of  the  parameters  of  the  logistic  distribution.  Pearl  and 
Reed  <33),  Schultz  <38)  and  Berkson  (5)  all  obtained  least 
square  estimates  of  the  parameters.  Maximum  likelihood  esti¬ 
mation  of  the  parameters  has  been  done  by  Ulilson  and  Ulorchester 
<43),  Berkson  <7),  and  Harter  and  Moore  <24>.  Berkson  and 
Hodges  developed  a  minimax  estimator  (8).  The  minimum  chi- 

square  technique  was  also  used  by  Berkson  to  estimate  the 
parameters.  Plackett  (34),  Kjelsberg  (27)  and  Gupta,  Qureishi 
and  Shah  <21)  have  all  developed  best  linear  unbiased  esti¬ 
mates  for  complete  and  censored  samples,  while  Plackett  deve¬ 
loped  linear  estimates  from  censored  data  (35).  Beyer  inves¬ 
tigated  the  conditional  estimation  of  the  scale  parameter 
using  selected  order  statistics  in  1 966  (?) .  Also  using 

selected  order  statistics,  Richardson  investigated  simulta¬ 
neous  linear  estimation  of  the  location  and  scale  parameters 
<  37)  . 

Exact  moments  of  the  order  statistics  of  the  logistic 
distribution  have  also  been  developed.  Birnbaum  (10),  Birnbaum 
and  Dudman  <  1 1 )  ,  Plackett  <34),  and  Gupta  and  Shah  (22)  have 
all  worked  on  the  exact  moments  of  the  distribution.  Shah  has 
tabled  variances  and  covariances  of  logistic  order  statistics 
for  sample  sizes  up  to  10  (39).  Gupta,  Qureishi  and  Shah 
extended  this  through  a  sample  size  of  25  (21).  Harter  and 
Moore  established  the  asymptotic  variances  and  covariances  of 


the  maximum  likelihood  estimators  in  1967  <24)  and  Bain  <4> 


presented  tabled  percentage  points  for  the  maximum  likelihood 
est imators . 


The  Logistic  Distribution 

The  shape  of  the  logistic  distribution  is  convex  and 
symmetric  about  the  mean  and  similar  to  that  of  the  normal 
distribution.  The  notable  difference  is  that  the  tails  are 
relatively  thick,  more  like  that  of  the  exponential  distribu¬ 
tion.  The  location  parameter  or  mean  (l1)  relates  the  point 
of  symmetry,  the  median  and  the  mode  of  the  logistic  density 
function.  The  standard  deviation  <<T>  measures  the  relative 
dispersion  of  the  distribution  along  the  axis  of  the  indepen¬ 
dent  variable. 

The  logistic  distribution  with  mean  F  and  standard 
deviation  <T  is  defined  as 

F< x )=  I  l+expt-K<x-P)/cr433  3'1  (9 


with  the  scale  parameter  defined  as  <t/3/K. 
The  density  function  is  expressed  as 


Hexp  [  -<<  x-P  )/<T-i33 
cr-/3[l+expt-K(x-P)/<7j33  3  2 


( 10‘ 


The  location  parameter  F  indicates  the  value  of  x  at  which 
failures  are  most  likely  to  begin  occuring. 


The  moments  of  the  logistic  distribution  can  be 

us  i  ng 


f  *  x^ltexpt  -*<  x-F  )/<t43] 

J-oo 

c<r-/3t  l  +  expC-KCx-«i>/<r43]]'2]  "*dx 


or  more  easily  by  the  moment  generating  function  (mgf) 


E(explxt3)=  f°°  expCxt]KexpC-K<x-l*) /<?43 ] 

■CD 

t<r43t  l+expt-K<x-l»)/<r-/3]  ]*2]_1dx 

From  the  reduced  variant,  y=  <x-F>/<T,  Gumbel  <?0)  has 
shown  that  the  mgf  can  be  expressed  as 


found 


(11) 


(12) 


(  13> 


m  < t>=  r< i+t/c>r< l-  t/c> 

x 

which  is  a  Beta  -function  were  c=  It  /43. 


APPLICATION  OF  THE  LOGISTIC  DENSITY  FUNCTION 

I-f  a  random  variable  represents  the  lifetime  or  time  to 
-failure  of  a  unit,  then  the  study  of  that  variable  is  said  to 
be  in  the  area  of  life-testing  or  reliability  theory.  The 
probability  that  a  unit  survives  until  time  x  is  called  the 
reliability  of  that  unit  at  time  x  and  denoted  as  R(x>  =  l-F(x). 
On  the  other  hand,  the  hazard  function  of  the  unit  may  be 

interpreted  as  the  instantaneous  failure  rate  of  the  unit 

<4:42).  It  is  often  more  informative  to  consider  the  hazard 

function  of  a  model  than  to  look  at  the  shape  of  the  pdf  or 

cdf  directly.  A  typical  hazard  function  in  the  area  of  life 
testing  is  a  U-shaped  or  bathtub  shaped  curve  (Fig  1). 
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Typical  hazard  function 
Figure  1 

Normally,  when  a  unit  is  placed  in  service  it  first  goes 
through  a  per  lod  where  the  frequency  of  failures  is  decrea¬ 
sing.  That  is,  as  manufacturing  defects  are  overcome,  the 
reliability  of  the  unit  improves  with  age.  The  unit  then  goes 
through  a  period  where  fai lures  are  more  or  less  random  at  a 
constant  rate.  As  the  unit  begins  to  wear  out  or  deteriorate 
the  failures  become  more  frequent  and  the  failure  rate  in¬ 
creases. 

The  hazard  function  for  the  logistic  distribution  is 


expressed  as 


f<x>  KF  ( x ) 

h  <  x )  = _  =  _ 

1  -  F(x)  <t43 


<  1- 


h(x)  is  an  increasing  -function  o-f  x  and  it  is  easy  to  see  that 
h(x)  approachs  X/<?43  as  x-*».  This  property  may  be  well  suited 
to  the  analysis  of  certain  systems.  For  example,  the  hazard 
function  of  a  system  may  not  be  characterized  by  the  normal 
bath-tub  curve.  When  placed  in  operation,  the  system  may 
almost  immediately  begin  to  wear  out  with  an  increasing  fail¬ 
ure  rate.  The  failure  rate  may  than  approach  a  constant,  and 
continue  thus  for  quite  some  time.  In  situations  such  as  this 


the  logistic  function  can  be  used  to  advantage 


METHODOLOGY 


This  thesis  presents  tabled  critical  values  of  the  modi¬ 
fied  Kol mogorov-Sm i rnov  ,  Anderson-Darling  and  Cramer-von  Mises 
goodness-of  — f  i t  tests  for  the  logistic  distribution  with  un¬ 
known  location  and  scale  parameters.  This  chapter  discusses 
the  procedures  used  in  this  thesis.  First,  an  outline  and 
flow  chart  of  the  Monte  Carlo  method  is  presented.  Next, 
there  is  a  description  of  the  steps  followed  in  this  method. 
Finally,  the  chapter  concludes  with  a  discussion  of  the  power 
study  between  these  test  statistics. 


STEPS  IN  THE  MONTE  CARLO  METHOD 

The  following  eight  steps  outline  the  Monte  Carlo  method 

used  to  calculate  the  critical  values  for  the  modified  KS, 
2  2 

”  and  W  goodness-of-f i t  tests.  The  flow  chart  in  Figure  2 
illustrates  this  method. 

1.  For  a  fixed  sample  size  n,  random  deviates  from  the 
logistic  distribution  are  generated  using  a  computer  subrou¬ 
tine. 

2.  Estimates  of  the  location  and  scale  parameters  are 
calculated  using  the  method  of  maximum  likelihood.  The 
MLE  estimates  are  calculated  iteratively  using  the  Secant 
method . 

3.  The  random  deviates  obtained  in  step  one  are  arranged 


in  ascending  order  using  a  computer  subroutine. 


Gtnerat*  random 

d*w i *t** 


4.  The  estimated  parameters  are  used  to  calculate  the 
hypothesized  distribution  F(x). 

5.  The  KS  statistic  is  calculated  using  Eq  <4>,  the 

2  2 
A  statistic  using  Eq  <6>  and  the  W  using  Eq  <8>. 

6.  The  above  steps  are  repeated  5000  times.  As  a  result 
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5000  independent  KS,  A  and  LJ  statistics  are  generated. 

7.  Each  group  of  5000  statistics  are  ordered.  Using 

plotting  positions,  discussed  later,  the  80th,  85th,  90th, 

95th,  and  99th  percentiles  o-f  the  distribution  o-f  each  statis¬ 
tic  are  calculated  by  linear  interpolation. 

8.  Steps  1  to  7  are  repeated  -for  samples  sizes,  equal  to 
5,  10,  15,  20,  25  and  30. 


Generation  o-f  random  logistic  deviates 

There  is  no  known  available  computer  routine  to  generate 
random  deviates  from  the  logistic  distribution.  However,  the 
probabl i ty  integral  transform  insures  that  a  random  variable 
from  any  distribution  can  be  transformed  to  a  random  variable 
distributed  uniformly  on  the  interval  zero  to  one.  Further, 
this  transformation  is  independent  of  the  location  and  scale 
parameters  of  the  distribution  (17).  Because  of  this,  the 
concept  can  be  reversed  to  generate  random  deviates  for  the 
logistic  distribution  by  transforming  random  deviates  distri¬ 
buted  uniformly  on  the  interval  zero  to  one.  That  is,  if  RN 


is  a  pseudo-random  deviate  distributed  uniformly  (0,1 >,  it  can 


RN=  I  l+exp[-lt(x-»‘)/<r-i33]'1 


(15) 


and  solving  -for  x  yields 


x=  M  -  [0"-/3(lnC(l  -RN )  /RN  ]  )  ]  /<  (16) 

However,  this  still  leaves  the  problem  o-f  generating 
pseudo-random  deviates  distributed  uniformly  (0,1).  The  ran¬ 
dom  uniform  deviates  for  this  thesis  are  obtained  on  the 
Control  Data  Corporation  <CDC>  6600  computer  using  the  Inter¬ 
national  Mathematical  and  Statistics  Library  (IMSL)  subroutine 
GGUBFS  <25:Ch  G) . 


Maximum  likelihood  estimates  of  the  logistic  parameters 

A  procedure  to  derive  the  maximum  likelihood  estimates 
(MLE)  for  location  and  scale  (^  and  ^  0*  th#  ,ogjstic  dis_ 

tribution  was  developed  by  Harter  and  Moore  (23).  This  proce¬ 
dure  iteratively  solved  the  first  partial  derivatives  of  the 
likelihood  function  after  initial  estimates  have  been  chosen. 


Iterative  linear  interpolation  was  applied  at  each  step  until 
the  results  of  successive  steps  agree  to  within  some  assigned 
tolerance.  A  procedure  similar  to  this  is  used  in  this  thesis 
The  maximum  likelihood  method  of  estimation  is  one  of  the 
most  commonly  used  methods  of  parameter  estimation  when  using 
EDF  statistics.  Some  of  the  reasons  for  this  are  that  MLE's 
are  consistent  and,  more  importantly,  they  are  invariant 
<24:239).  The  method  of  maximum  likelihood,  in  essence,  se¬ 
lects  as  estimates  of  the  unknown  parameters  those  values  for 
which  the  observed  sample  would  have  most  likely  occured 
<3:83).  That  is,  this  method  selects  as  estimates  those 
values  that  maximize  the  probability  of  the  occurance  of  the 
sample  results  <24:234). 

The  likelihood  function  is  defined  in  the  following  way, 

j  X  w 

i,x  ,...x  are  sample  observations  on  X,  then  the  likeli- 
2  n 

hood  function  <L)  is  defined  to  be  the  joint  density  function 

evaluated  at  x  ,,x  .  ...x  .  The  likelihood  function  can  be  repre- 
1  2  n 

sented  by 


n 


L=TT"f <x,.p  <r) 

i=l  ' 

* 


<  1 


The  procedure  to  determine  the  MLEs  and  <T^  is: 

1.  Take  the  partial  derivatives  of  L  with  respect  to 


each  parameter. 


2.  Set  these  equations  equal  to  zero  and  solve  simultan¬ 
eously  -for  the  values  of  the  parameter  estimates. 

Quite  often  this  procedure  is  easier  to  implement  if  the 
natural  logarithms  of  L  are  taken  first.  Doing  this  and  using 
Eqs  <10>  and  <1 7)  the  likelihood  function  for  the  logistic 
distribution  can  be  written  as 

n^  itexp  t  -<<  x  ,-P  )/<j,433 

i  =  l  <r4"3<  1 +  exp [ -<< x  -p >  * 


and  if  z  =  <<x  -P )/<r^3  then 
i  i 


<  1?> 


but  since  there  are  n!  permutations  of  the  realizations  on  the 
random  variables  we  have 


L= 


i) 


n  expt-z  ] 

TT  - i__2 

i  =  1  (1+expE-z.]) 


(20) 


The  natural  logarithm  of  L  is  then 


ln<L)’ 


n  n 

1  n< n  !  ) ♦nl n<1t/<r/§>-  Z  *  +2  £  1  n<  1/1  +exp£ -z  1>  <2I> 

i=l  i-1  ■ 


Taking  the  partial  derivatives  of  ln<L)  with  rttptd  to  9 
and  O  yitlds 


d  I n < L) 


x 

<j4 3 


•xpt-z. J 
n  -2£ _ L_ 


1+expC-z  ] 


(  22) 


ain(L) 

dv 


l 

<r 


Zz  -2Z 


z.exp<~z  ) 

i  _ i 

l+exp£-z.J 


-n 


<23) 


These  aquations,  whan  sat  aqual  to  zaro,  can  not  ba  sol  wad 
explicitly.  Howawar ,  tha  roots  of  thasa  aquations  can  ba 
found  using  itaratiwa  methods  on  a  computer. 


‘.A 
A 


Tha  Sacant  Mathod 

Tha  Sacant  mathod  for  finding  a  root  of  f(x>=0  is  a 
slight  variation  of  Newton's  mathod  (12s39>.  Therefore,  a 
brief  discussion  of  Newton's  mathod  will  pracaad  that  of  tha 
8ecant  mathod. 
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Suppose  that  the  -function  -f  is  twice  continuously  di-f-fer 


entiable  on  the  interval  Ca,b3.  Let  x'  be  an  element  of  t h . 
set  of  real  numbers  bracketed  by  Ca,bl.  Then  x'  is  an  approx 
imation  of  the  root  p  of  the  equation  f(x)=0  if  f'<x'>  is 
not  equal  to  zero  and  if  the  absolute  difference  between  x 
and  p  is  “small*.  Consider  the  second-degree  Taylor  polyno 
mial  for  f(x)  expanded  about  x ' 


f  <x>  =  f <xy )  +  <x-x/ )f ' (x' >  +  C <x-x"  >^/23f ' " <P<x) ) 


where  £(x)  lies  between  x  and  x' 


Since  f<p)=0  if  x=p  Eq .  <24)  becomes 

0  =  f  <x"  )  +  <p-x"  >f  ‘,<x")  +  C<p-x',>2/21f  "(P<x)  >  C 

2 

Since  Ip-x'!  is  assumed  to  be  small,  (p-x'>  is  smaller. 

2 

If  (p-x')  is  considered  negligible  then  the  third  term  in 
Eq  (25)  can  be  dropped.  Solving  for  p  then  yields  approxima- 


However,  with  respect  to  the  logistic  function,  we  want  to 
determine  the  roots  of  the  first  derivative  instead  of  the 
likelihood  function  itself.  That  is,  Eq  (26)  becomes 


p  =  x'-lL'(x')/L"(x')1 


(27) 


and  this  would  require  iteratively  evaluating  both  the  first 


and  second  derivatives  of  the  likelihood  function.  To  circum¬ 
vent  the  cumbersome  calculations  of  the  second  derivative  the 
Secant  method  can  be  used.  It  is  a  variation  of  Newton's 


method  and  derived  as  follows! 


By  definition 


*'<P  ,) 
n-1 


(28) 


if  x-p  then 
n-2 


f'<p  )  = 
n-1 


f(p  )-f (p  > 
n-2  n-1 


Pn-2-pn-l 


(2?) 


and  Newton's  -formula  becomes 


P  = 
« 


*<P 

>Tp" 


B-i 


n-i 


><Pn-rP 


n-2 

> 


> 


(3 


or  in  the  case  o-f  the  likelihood  -function  -for  the  logistic 
distribution 


p  = 

n 


ft-1 


L'  <  p  )<p  -p  > 
ft-1  n-1  Kn  -2 

L'<p  ) -L' <  p  ) 
n-1  n-2 


(3 


thus  eliminating  the  need  -for  second  derivatives. 

One  requirement  o-f  both  Newton's  and  the  Secant  method  is 
that  a  good  initial  approximation  be  chosen.  Otherwise  the 
method  may  diverge.  In  this  thesis  the  sample  mean  and  stan¬ 
dard  deviation  will  be  used  as  initial  approximations. 


Orderino  the  Deviates 

The  random  logistic  deviates  are  arranged  in  ascending 
order  using  the  IMSL  subroutine  VSRTA  (25:Ch  V). 


The  Hypothesized  Distribution  -function 

The  location  and  standard  deviation  maximum  likelihood 
estimates  (^  and  and  ^e  n  ordered  logistic  deviates  (x 
are  used  to  calculate  the  hypothesized  distribution  -function 
F( x )  by 


F(x.)=  l/<  l+expC-lt(x.-PE)/<rEJ3) 


<3 


Using  the  hypothesized  and  sample  distributions  the  KS,  A 
2 

and  U  statistics  are  calculated.  This  is  done  5000 
times,  once  -for  each  o-f  the  5000  samples  of  size  n. 


Determining  the  critical  values  o-f  the  Qoodness-of-f i t  tests 

The  5000  statistics  -for  each  of  the  three  tests,  for  each 
sample  size,  are  ordered  before  determining  the  critical  va¬ 
lues.  Using  plotting  positions  and  linear  interpolation  the 
80th,  85th,  90th,  95th,  and  99th  percentiles  of  the  distribu¬ 

tion  of  each  test  statistic  are  found.  These  percentiles  are 
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the  critical  values  for  the  KS,  A  and  W  goodness-of-f i t  tests 
Given  a  series  of  ordered  values,  the  plotting  position 
of  each  event  is  its  cumulative  probability  (22:5).  There¬ 
fore,  the  use  of  plotting  positions  to  determine  critical 


values  requiress  a  careful  enumeration  of  the  cumulative  prob- 


abilities  on  the  ordinate  axis 


This  enumeration  is  compli- 


pyv  jv-. 

i 

e 


p-> 


ft- 


*.  ■ 
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cated  by  the  lack  of  probab i 1 i s t i c  values  for  the  end  points 
of  the  order  statistics.  For  example,  it  can  not  be  said 
that  the  first  order  statistic  occurs  with  probability  zero. 
Likewise,  the  probability  of  the  last  order  statistic  is  not 
one  since  one  more  real ization  of  the  random  variable  may 
yield  a  higher  valued  statistic.  Because  of  this,  it  is 
necessary  to  consider  the  ordinate  value  of  each  statistic  as 
a  function  of  its  relative  position. 

Three  plotting  positions  for  each  point  are  considered. 
These  are  the  middle  of  the  interval  between  the  i/nth  and 
<i-l)/nth  points,  the  median  ranks  plotting  position  and  the 
mean  position.  The  first  plotting  position  can  be  expressed 
as 


< i - . 5>/n 


<3 


This  is  the  value  midway  through  the  jump  from  < i -1 )/n  to  i -n 
and  was  first  proposed  by  Hazen  in  1914  (22:1).  The  second 
plotting  position  can  be  very  closely  approximated  by 


( : 


( i -0 .3)  /  <n-0 .4) 


£ 


This  value  is  essentially  the  median  value  of  the  distribution 
of  the  i th  order  statistic  and  was  proposed  by  Johnson  in  1951 
<22:1).  The  third  plotting  position  is  the  mean  position  and 
i s  expressed  as 

i / ( n  + 1 >  <  35) 

t 

This  value  is  the  expected  value  of  the  cdf  population  at  the 
i th  order  statistic.  It  was  proposed  by  We i bu 1 1  in  1939 
<22:1). 
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The  KS,  A  ancj  ^  ordered  statistics  form  the  basis 
of  the  abscissa  axis.  The  associated  ordered  plotting  posi¬ 
tions  form  the  basis  of  the  ordinate  axis.  The  last  entry  is 

added  to  these  ordered  values  making  up  the  5001  valves  on  the 
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abscissa  and  ordinate  axes.  The  5000  ordered  KS  <A  or  W  ) 
statistics  form  the  1st  to  5000th  positions  on  the  abscissa 
axis.  The  last  position  is  calculated  by  linearly  extrapo¬ 
lating  from  the  4999th  and  5000th  entries.  The  calculation  of 
the  5001st  entry  is  not  subject  to  a  maximum  value.  The  5000 
ordered  plotting  positions  form  the  1st  to  5000th  positions  on 
the  ordinate  axis.  The  interval  is  completed  by  entering  a 
one  in  the  5001st  position.  The  addition  of  these  "extra" 
values  on  each  axis  allows  an  easier  determination  of  the  i th 
and  <i-l)th  points. 

The  critical  values  are  calculated  by  linear  interpola¬ 
tion  between  the  ordered  statistics  and  the  corresponding 


plotting  positions.  For  example,  the  plotting  position  just 
larger  and  just  smaller  than  .90  are  determined.  These  become 
the  i  th  and  (i-l)th  points.  The  ratio  of  the  difference 
between  the  desired  percentage  point  and  the  (i-l)st  point  is 
then  calculated.  This  ratio  is  then  applied  to  the  difference 
between  the  ith  and  <i-l)th  test  statistics  to  yield  the 
desired  percentile.  Figure  3  shows  this  procedure  graphical¬ 
ly.  At  5000  repeitions  there  is  no  difference  in  the  third 
significant  digit  in  calculating  the  KS,  A  or  statistics 

for  the  three  plotting  positions.  For  simplicity,  the  plot¬ 
ting  position  described  by  Eq  (33)  is  used. 


Power  comparison 


In  this  thesis  a  comparison  of  the  power  of  the  modified 
2  2 

KS,  A  and  W  tests  is  made.  The  power  of  each  test  is  com¬ 
pared  -for  several  different  distributions.  The  hypothesis 
being  tested  is 

Hgj  the  sample  values  follow  a  logistic  distribution. 

H  :  the  sample  values  do  not  follow  a  logistic  distribution 
A 

Using  IMSL  subroutines  on  the  CDC  6600,  random  deviates  from 

different  distributions  of  sample  size  n  are  generated.  Th 
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test  statistics  KS,  A  and  W  are  calculated  under  the  null  hy¬ 
pothesis.  These  test  statistics  are  then  compared  to  the 
respective  critical  values  developed  in  this  thesis.  This 
procedure  is  repeated  1000  times  for  each  sample  size.  The 
number  of  times  the  test  statistic  exceeds  the  critical  value 
is  counted.  Exceeding  the  critical  value  results  in  a  rejec¬ 
tion  of  H  The  power  of  the  test  for  a  given  sample  size 
is  the  number  of  rejections  divided  by  the  total  number  of 
tests,  1000. 

The  different  distributions  considered  in  this  power 
study  are: 

1 .  Uni  form 


2.  Exponential 

3.  We i bu 1 1 < shape=3) 

4.  Gamma  <shape=3> 


5.  Gamma  <shape=6> 

6.  Gamma  (shape®?) 

7.  Gamma  (shape=15) 

8.  Gamma  (shape=30) 

9.  Normal 

1 0 .  Log i st i c 

The  g^.T<na  distribution  is  o-ften  used  in  reliability  and 
maintainability  theory.  Its  density  -function  is  expressed  as 


x*  *exp£  -x ] 

f  <x>  = _  (36) 

TOO 

where  K  =  shape  parameter 

A  graph  of  the  gamma  distribution  for  various  values  of  K  is 
shown  in  Figures  4a  and  4b. 
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The  We  i  bu  11  distribution  is  also  o-ften  used  in  conjunc¬ 
tion  with  reliability  and  maintainability  studies.  Its  density 
■function  is  expressed  as 


f  <  x )  Kx**  *expt  -x*S 


(37) 


where  K  holds  the  same  notation  as  in  Eq  (27).  A  graph  o-f 
the  Ueibull  distribution  is  shown  in  Figure  5. 

As  can  be  seen,  the  gamma  and  we i bull  distributions  are 
generally  symmetrically  convex  -for  the  correct  parameter  values 
The  normal  and  exponential  distributions  are  well  known  and  nee 
not  be  discussed  here. 


Computer  proorams 

Computer  programs  used  in  this  thesis  are  presented  in 


Appendix  E. 


IV.  USE  OF  TABLES 


This  chapter  discusses  the  use  of  the  tabled  critical 
values  of  the  two  parameter  Logistic  distribution  for  the 
modified  KS,  A^  ancj  y*  statistics  generated  in  this  thesis. 

An  example  follows,  with  an  explanation  of  the  basic  procedure 
to  utilize  the  table. 

For  all  three  goodness-of-f i t  tests,  a  theoretical  dis¬ 
tribution  F<x)  is  compared  to  an  empirical,  observed  distribu- 
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t i on  S<x).  The  KS,  A  ,  or  U  statistic  is  calculated  using  the 
appropriate  equation  <Eq  <4>,  Eq  C 6)  or  Eq  (8)).  If  the  value 
of  the  statistic  exceeds  the  tabled  critical  value,  the  theore¬ 
tical  distribution  is  rejected.  The  steps  in  applying  this 
procedure  ares 

1.  Determine  the  sample  size  n  and  the  level  of  signi¬ 
ficance.  The  significance  level  is  the  probability  of  rejec¬ 
ting  the  null  hypothesis  that  the  sample  is  from  the  Logistic 
distribution  when  the  null  hypothesis  is  true. 

2.  Select,  in  a  random  manner,  the  n  observations  from 
the  population  to  be  tested  and  order  them  from  smallest  to 
1 argest , 

3.  Estimate  the  unknown  location  and  scale  parameters 
from  the  observed  sample  using  the  method  of  maximum  likeli¬ 
hood. 


4.  Completely  specify  the  hypothesized  distribution  F<x> 


using  the  estimated  location  and  scale  parameters.  Determine 
the  values  of  the  empirical  distribution  just  prior  to  and 
after  each  o-f  the  1/n  jumps. 
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5.  Determine  the  value  o-f  the  KS,  A  (  or  W  test  statistics 
by  using  Eq  (4),  Eq  (6)  and  Eq  <8>. 

6.  Find  the  intersection  o-f  the  s  i  gn  i -f  i  cance  column  and 
the  sample  size  row  in  the  table  of  values.  This  is  the 
critical  value  to  be  compared  to  the  test  statistic. 

7.  Reject  the  null  hypothesis  if  the  value  of  the  test 
statistic  exceeds  the  critical  value.  If  the  test  statistic 
does  not  exceed  the  critical  value,  we  fail  to  reject  the  null 
hypothesis  and  conclude  there  is  insufficient  evidence  to  say 
the  observed  sample  does  not  follow  the  logistic  distribution. 

EXAMPLE 

The  following  example  illustrates  this  procedure  for  the 
modified  Kol mogor ov -Sm i r nov  test.  For  a  sample  size  of  five 
the  following  numbers  are  obtained:  104.9829,  81.1517, 

87.2204,  113.5512,  and  61.5415.  For  this  sample,  The  maximum 

likelihood  estimate  subroutine  used  in  this  thesis  yields 
location  and  standard  deviation  parameter  estimates  of:  - 

90.0986  and  <7^  =  20.1107.  The  scale  parameter  is  calculated 
using  <T^43/t ,  yielding  a  value  of  1  1.0876.  Using  these 
values  the  hypothesized  distribution  is  calculated  for  each 
sample  value  using  Eq  <9).  The  significance  level  is  .05. 
The  hypothesis  tested  is: 


F(x)  I  S(  x  >  S(x  +  >  I  KS 


-071  0  .2  .129 

•309  .2  .4  .109 

•  436  .4  .6  .164 


.793  .6  .8  .193 

.892  .8  1.0  .108 


S  i  nee 


is  .  30? . 
the  nu 1 1 


Since  .193  is  less  than  .309,  we  -fail 
hypothesis  that  the  sample  comes  -from 


to  reject 
a  Logist 


d  i  str  i  bu  t  i  on  . 


Kf.  DISCUSSION  OF  RESULTS 


This  chapter  discusses  the  results  obtained  in  this  the¬ 
sis  as  they  pertain  to  the  objectives  set  -forth  in  Chapter  I. 


PRESENTATION  OF  THE  TABLE  OF  CRITICAL  VALUES 

The  tabled  critical  values  (-for  sample  sizes  o-f  5,  10, 
15,  20,  25,  and  30)  -for  the  modi-fied  KS,  A^  and  w2  g0odness- 

o-f — -f  i  t  tests  are  presented  in  Appendancies  A,  B  and  C  respec¬ 
tively.  Each  table  is  valid  -for  the  Logistic  distribution 
when  the  location  and  scale  parameters  are  estimated  -from  the 
sample  using  the  method  o-f  maximum  likelihood  presented  in 
this  thesis. 

However,  the  Secant  method  used  in  this  thesis  has  cer¬ 
tain  properties  that  require  exp  1  a i nat i on .  As  stated  earlier, 
the  Secant  method  converges  only  when  the  initial  estimates  of 
the  parameters  are  "good*.  In  this  thesis  the  sample  mean  and 
standard  deviation  are  used  as  initial  estimates  of  the 
unknown  parameters.  In  all  but  four  of  the  30,000  samples 
used  in  this  thesis,  the  sample  mean  and  standard  deviation 
allowed  convergence  using  the  Secant  method.  The  four  cases 
of  non-convergence  occured  when  the  sample  was  very  tightly 
grouped,  thus  yielding  a  very  small  sample  standard  deviation. 
This  occured  two  times  in  a  sample  size  of  five  and  once  each 


r. 

M 


in  the  sample  sizes  of  10  and  15.  Discarding  these  non-converg- 
ent  samples  and  obtaining  a  new  sample  theoretically  biases 
the  resulting  cal  cu  1  at  i  ons .  However,  it  was  -felt  that  such  a 
small  number  o-f  samples  could  be  discarded  without  meaningfully 
biasing  the  numer i cal  results  of  this  thesis. 

This  conclusion  is  reached  for  the  following  reason. 

Discarding  samples,  within  a  single  sample  size,  changes  the 
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relative  positions  of  some  of  the  KS,  A  ancj  y  statistics  cal¬ 
culated  for  that  sample  size.  That  is,  if  five  samples  are 
taken  and  five  KS  statistics  obtained,  they  will  have  a  speci¬ 
fic  ascending  order.  However,  if  the  third  sample  is  discard¬ 
ed  and  a  “new"  sample  and  KS  statistic  obtained,  the  order  of 
some  of  the  five  KS  statistics  might  be  changed.  The  "new"  KS 
statistic  may  not  fit  in  the  third  ordered  position. 

The  most  severe  case  of  discarding  samples  in  this  thesis 
is  two  out  of  5,000.  This  occured  with  a  sample  size  of 
five.  This  effectively  means  that  any  calculated  statistic 
would  be,  at  most,  two  "positions"  out  of  order .  The  statis¬ 
tics  calculated  in  this  thesis,  when  ordered,  general y 
increment  in  the  the  fourth  digit.  Additionaly,  a  statistical 
value  two  positions  out  of  order  would  cause  an  error  of 
±.04X.  That  is,  for  5000  repetitions,  if  a  statistical  value 
is  two  positions  out  of  order,  then  a  critical  value  based 
on  this  ordering  is  at  most  ±.04’/.  in  error.  For  sample 

sizes  of  10  and  15,  the  critical  values  reported  would  be  in 
error  by  at  most  ±.02’/..  It  seems  unlikely  that  any  change 
in  the  three  significant  digits  reported  occured  due  to  the 
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pol icy  chosen . 

However,  this  property  of  non-convergence  in  some  cases 
represents  a  constraint  on  using  the  Secant  method  when  -finding 
the  maximum  likelihood  estimates  o-f  the  unknown  parameters. 

It  is  not  unreasonable  to  expect  some  cases  o-f  live  data  to 
truely  have  a  small  standard  deviation.  The  program  used  -for 
this  thesis  to  estimate  the  unknown  parameters  o-f  the  Logistic 
distribution  may  therefore  be  inappropriate  in  some  cases.  A 
gradient  search  method,  such  as  that  presented  by  Wingo 
<44:91),  may  overcome  this  problem. 

ANALYSIS  OF  CRITICAL  VALUE  TABLES 

Analysis  of  the  tabled  critical  values  generated  by  this 
thesis  and  the  distributions  of  the  KS,  A2  anc*  y2  statistics 
reveal  the  following  results. 

In  the  case  of  the  KS  tabled  critical  values,  analysis 
relative  to  the  significance  levels  and  sample  size  reveal 
that  a  concave  pattern  is  relatively  constant.  The  change  in 
significance  level  simply  shifts  this  curve  upward  <Fig  6). 


Analysis  of  the  distributions  of  the  KS ,  A^  ancj 
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W  statistics  di-»el  oped  in  this  thesis  reveal  the  following 
result.  For  all  samp le  sizes,  the  distri button  of  all  three 
statistics  approximately  follows  a  Giomper  tz  curve  (Fig  9). 
For  each  statistic,  this  curve  is  slightly  mod i f i ed  by  samp  1 e 


L»ALIDATION  OF  COMPUTER  PROGRAM 

The  computer  program  used  to  generate  the  critical  values 
obtained  in  this  thesis  is  validated  by  comparing  the  criti¬ 
cal  values  obtained  in  this  thesis  to  those  obtained  by  M.A. 
Stephens  and  by  hand  calculations.  The  hand  calculations 
validate  that  the  computer  program  generates  numbers  correct¬ 
ly,  while  the  comparison  to  Stephens  values  validates  that 
the  values  of  this  thesis  are  acceptable.  In  1979  Stephens 
calculated  modified  KS,  A2  and  y2  statistics  -for  the  Logistic 

distribution  based  on  asymptotic  distribution  theory.  This 
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thesis  calculated  KS,  A  and  W  statistics  based  on  finite 
distribution  theory.  If  the  two  sets  of  statistics  are  compar¬ 
able  this  implies  the  computer  program  generates  valid  results. 

This  comparison  was  done  in  the  following  manner.  The 
critical  values  obtained  in  this  thesis  are  modified  according 
to  the  appropriate  formula  presented  in  Stephens  paper 
<14:14,16).  The  result  is  then  compared  to  the  tabled  values 

presented  by  Stephens.  For  example,  with  a  sample  size  of  20 

2 

and  an  alpha  level  of  .05,  the  A  statistic  obtained  in 
this  thesis  is  .655.  This  value  was  then  modified  by  equa¬ 
tion  38. 


A 


2 


< 1  .0+<0 .25/n> ) 


(38) 


The  result  (.663)  is  then  compared  to  0.660,  the  value  presented 


by  Stephens. 


Such  comparisons  -for  all  values  obtained  in  this 


Table  III 


Anderson-Darling  <A  >  Comparison 


A  < 1  +  . 25/n  > 
n 

10  2 


Stephens 
Cr i t i cal 
values 


,570  .562  .564 
,651  .656  .663 
,891  .874  .887 


.563 

.660 

.906 


Table  IV 

2 

Cr  amer-Oon  Mises  (U)  >  Comparison 

1  -a 

<nW2-0 .08>/( n-1 ) 
n 

5  10  20 

Stephens 

Cr i t i cal 
values 

.90 

.081  .081  .081 

.081 

.95 

.099  .097  .096 

.098 

.99 

.135  .129  .133 

.136 

An  assumption  requiring  validation  is  with  respect  to  the 
MLE  estimates  generated.  Maximum  likelihood  theory  specifies  that 
MLE  estimates  are  invariant.  The  invariant  property  is  critical 
with  respect  to  this  thesis  since  the  validity  of  the  tabled 
critical  values  depends  on  it.  This  assumption  is  examined  with 
the  MLE  estimates  of  a  1 ogi st i c< 1 ,4)  and  a  1  og i st i c ( 4 , 1  6)  .  In 
order  to  be  invariant  the  MLE  estimates  of  the  1 og i st i c ( 1 , 4) 
should  be  equal  to  the  MLE  estimates  of  the  1 ogi st i c (4 , 1 6)  after 
multiplication  by  a  factor  of  four.  These  estimates  were  ob¬ 
tained  on  separate  computer  runs  for  sample  sizes  of  5,  15  and 
25.  Hand  calculations  show  that  when  multiplied  by  a  factor  of 
four  the  1 og i st i c < 1 , 4)  MLEs  differed  from  those  of  the  logis¬ 
tic^, 16)  by  generally  0.000001.  This  small  difference  is  attri¬ 
buted  to  computer  round-off. 


SENSITIVITY  ANALYSIS  OF  PROGRAM 


A  condition  required  by  the  Secant  method  is  that  for  conve¬ 
rgence  the  initial  estimates  must  be  "good*.  Since  in  a  few 
cases  the  Secant  method  used  in  this  thesis  did  not  converge,  an 
examination  of  what  "good"  means  is  conducted.  This  is  done  by 
systematicly  changing  the  population  standard  deviation  used  to 
generate  the  logistic  deviates,  and  generating  2000  samples  for 
each  sample  size.  Each  time  the  Secant  method  used  in  this 


thesis  does  not  converge,  the  sample  mean  and  standard  deviation 
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are  printed  out.  By  comparing  results  it  becomes  apparent  that 
when  the  sample  is  tightly  grouped  about  the  population  mean,  the 
sample  standard  deviation  becomes  very  small.  When  the  sample 
standard  deviation  is  generally  less  than  25-30X  of  the  popula¬ 
tion  standard  deviation,  the  method  -fails  to  converge. 


PRESENTATION  OF  THE  POWER  STUDY 

A  comparison  o-f  the  relative  power  o-f  the  modi-fied  Kolmo- 
gorov-Sm i r nov ,  Anderson-Darling  and  Cramer-Von  Mises  goodness- 
o-f— f  i  t  tests  developed  in  this  thesis  is  made.  For  distribu¬ 
tions  with  non-symme  tr  i  cal  convex  patterns  the  power  o-f  the 
2  2 

A  and  U  tests  is  very  good.  For  distributions  with 

symmetric  convex  patterns,  the  power  is  much  lower.  The 

distributions  used  in  this  power  study  are  listed  in  Chapter 

III.  For  each  o-f  these  distributions  1000  samples,  -for  each 
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sample  size,  are  obtained.  For  each  sample  a  KS,  A  ,  and  W 
statistic  is  calculated  using  Eq  (4),  Eq  <6>  or  Eq  <8).  Each 
statistic  is  compared  to  the  appropriate  tabled  critical  value 
in  table  V,  VI  or  VII.  This  comparison  is  done  at  the  0.05 
significance  level.  For  each  sample  size,  the  number  of  times 
the  calculated  statistic  exceeds  the  tabled  critical  value,  an 
index  variable  is  incremented  by  one.  This  index  variable  is 
then  divided  by  1000  to  yield  the  percent  of  time  the  null 
hypothesis  (that  the  sample  came  from  a  logistic  distribution) 
is  rejected. 


When  the  sample  comes  from  a  uniform  or  an  exponential 


distribution  the  power  of  the  modified  KS,  A  and  W  tests 
is  very  good  (Tables  I IX  through  XIII).  The  shape  of  the  pdf 
of  these  distributions  is  decidedly  not  symme tr i cal  1 y  convex. 
This  undoubtedly  is  the  reason  the  tests  presented  in  this 
thesis  are  so  powerful  when  testing  samples  from  these  distri¬ 
butions. 

However,  when  the  sample  comes  from  a  distribution  with  a 
symmetric  convex  pattern  the  power  of  the  tests  presented  in 
this  thesis  is  much  lower.  This  is  due  primarily  to  the  diffi¬ 
culty  of  distinguishing  between  similar,  yet  different,  symme¬ 
trical  convex  patterns.  The  distributions  tested  in  this  cate¬ 
gory  are  the  Normal,  We i bu 1 1 ,  and  various  Gamma  distributions. 
In  the  case  of  the  Wei  bull  and  Normal  distributions,  as  well 

as  the  Gamma  with  higher  shape  parameter  values,  the  power  of 
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the  modified  KS,  A  and  W  tests  is  little  more  than  the 
significance  level.  Only  when  the  symmetric  convex  pattern  of 
the  Gamma  distribution  becomes  skewed  does  the  power  of  these 
tests  begin  to  noticeably  exceed  the  significance  level. 

The  modified  tests  presented  in  this  thesis  generally 
reject  the  null  hypothesis  of  a  logistic  distribution  when  the 
sample  comes  from  a  Gamma  distribution.  Yet,  they  have  only 
moderate  power  for  lower  shape  parameter  values,  when  the 
symmetric  convex  pattern  is  skewed.  For  higher  shape  parameter 
values  thier  rejection  power  is  only  slightly  above  the  signi¬ 
ficance  level.  This  shows  how  difficult  it  is  for  these  tests 
to  distinguish  between  other  symmetric  convex  patterns  and  the 
symmetric  convex  pattern  of  the  logistic  distribution. 


The  Normal  distribution  is  symmetrically  convex  and  very 
close  in  shape  to  that  o-f  the  Logistic  distribution.  All 
three  modified  tests  have  considerably  more  difficulty  rejec¬ 
ting  the  null  hypothesis  in  this  case.  Each  test,  generally, 
reports  a  different  power  for  each  sample  size.  The  power  is 
simetimes  slightly  above  the  significance  level  and  at  other 
times  slightly  below  it.  There  does  not  appear  to  be  any 
consistent  pattern  in  rejecting  the  null  hypothesis. 

When  applied  to  the  Logistic  distribution,  all  tests 
generally  failed  to  reject  the  null  hypothesis  that  the  sample 
came  from  a  logistic  distribution.  Yet,  in  this  case,  the 
rejection  percentage  is  not  always  equal  to  the  significance 
level  as  might  be  expected.  This  variability  is  pri mar i 1 y  a 
function  of  the  Monte  Carlo  method  and  the  1000  repetitions 
upon  which  the  study  is  based.  For  example,  the  expected 
number  of  times  the  power  of  these  tests  should  be  between  .04 
and  .06  based  on  1000  repetitions  is  approximately  three.  In 
actuality,  this  occured  four  times.  In  most  cases,  the  rejec¬ 
tion  percentage,  when  the  sample  comes  from  the  Logistic 
distribution,  is  between  .048  and  .052. 

In  summary,  all  three  tests  are  better  at  rejecting  the 

null  hypothesis  of  a  logistic  distribution  when  the  sample 

distribution  is  not  from  a  Normal  distribution,  than  when  it 

is  from  a  Normal  distribution.  When  the  sample  distribution 

o 

is  not  a  Normal  distribution,  the  W  test  is  slightly 
better  than  the  A^  or  KS  tests.  When  the  sample  is  from 


a  Normal  distribution  the  W  test  is  generally  better  than 
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Ml.  CONCLUSIONS  AND  RECOMMENDATIONS 


Cone  1  us i ons 

Based  on  results  obtained  in  this  thesis,  the  -following 
conclusions  are  noted: 


1.  The  Kol mogorov-Sm i rnov ,  Anderson-Darling  and  Cramer- 
von  Mises  critical  values  for  the  two  parameter  logistic 
distribution  are  valid. 

2.  The  power  compar i son  study  based  on  the  ten  different 
distributions  listed  in  Chapter  III  shows  that  for  non-symme- 
trical  convex  distributions  all  three  tests  exceed  the  claimed 
level  of  significance.  For  skewed  symmetrical  convex  distri¬ 
butions  all  three  tests  approximate  or  exceed  the  level  of 
significance  claimed.  For  symmetric  convex  distributions  all 
three  tests  very  closely  approximate  the  claimed  level  of 
significance.  This  indicates  a  goodness-of -f i t  test  for  a 
sample  from  a  symmetrical  convex  distribution  is  not  practi- 
cal.  The  W  test  is  generally  more  powerful  than  the 

A^  or  KS  tests. 

Recommendat i ons 

The  following  recommendations  are  suggested  for  further 
investigation: 


1.  Determine  if  another  method  for  determining  initial 


estimates  significantly  affects  the  critical  values  and  power 
presented  in  this  thesis. 

2.  Reduce  the  variability  in  the  significance  level  when 
the  null  hypothesis  of  a  logistic  distribution  is  true  by 
emp’oying  a  method  other  than  the  Monte  Carlo  method,  or 
increase  the  number  of  repetitions  used  in  the  power  study. 

3.  Employ  some  other  method  to  determine  the  critical 
values  for  the  logistic  distribution. 

4.  Using  Stephens  values  and  formuli  perform  a  power 
study  and  compare  those  results  to  the  results  of  this  thesis 
to  determine  which  method  is  more  powerful . 


5.  Develop  a  test  to  distinguish  between  symmetrical 


convex  distributions. 
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TABLE  M 


vvvsv; 


TABLE  I'll 


TABLE  I IX 


COMPARISON  OF  KOLMOGOROV- SMI RNOV ,  ANDERSON-DARLING 
AND  CRAMER-VON  MISES  STATISTICS  FOR 
SAMPLE  SIZE  OF  3,  fiMD  SIGNIFICANCE  LEVEL  OF  .03 


Un I  Form 
Exponan 1 1 a  1 
Waibull <3) 
G*innta<  3) 
Gamma(6> 
Gamma<9) 
Gamma ( 15) 
Gamma ( 30 ) 
Normal 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


KS 

A2 

.208 

.119 

.257 

.119 

• 

.023 

.027 

• 

.069 

.109 

• 

.050 

.079 

• 

.040 

.069 

• 

.099 

.079 

• 

.059 

.069 

• 

.059 

.050 

m 

168 

178 

039 

069 

059 

069 

089 

059 

039 


Logi at i c 


051 


049 


049 


TABLE  IX 


COMPARISON  OF  KOLMOGOROV-SMI RNOV,  ANDERSON -DARLING 
AND  CRAMER-VON  MISES  STATISTICS  FOR 
SAMPLE  SIZE  OF  10,  AND  SIGNIFICANCE  LEVEL  OF  .03 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


Un i form 


.287 


.198 


.2  67 


Exponent i a  I 


.535 


.495 


.564 


Wei  bull <3> 


.046 


.049 


.057 


Gamma <  3) 


.089 


.129 


.109 


Gamma<6) 


.050 


.059 


.040 


Gamma (  9) 


.079 


.069 


.069 


Gamma ( 15) 


.050 


.050 


.079 


GammaC  30 ) 


.047 


.050 


.053 


Normal 


.040 


.089 


Log i *t i c 


72 


TABLE  X 


COMPARISON  OF  KOLMOGOROV- SM I RNOV ,  ANDERSON-DARLING 
AND  CRAMER-VON  MISES  STATISTICS 
FOR  SAMPLE  SIZE  OF  IS,  AND  SIGNIFICANCE  LEVEL  OF  .OS 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


KS 


Un  i  -for 
Exponan  v 
U*  i  bu  1  I  <  3) 
Gamma<3) 
Gamma  <  6) 
Gamma  <  9) 
Gamma ( 15) 
Gamma  ( 30  > 
Normal 


.317 

.633 

.049 

.129 

.119 

.079 

.079 

.049 

.050 


.277 

.703 

.051 

.218 

.129 

.089 

.069 

.040 

.020 


.337 

.782 

.059 

.158 

.139 

.089 

.069 

.069 

.030 


Log i *t  i  c 


.050 


052 


061 


TABLE  XI 


COMPARISON  OF  KOLMOGOROV-SMI RNOV ,  ANDERSON-DARLING 
AND  CRAMER-VON  MISES  STATISTICS 
FOR  SAMPLE  SIZE  OF  20,  AND  SIGNIFICANCE  LEVEL  OF  .03 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


Un  i  -form 
Exponcn  t i a  I 
Weibul 1 <3> 
Gamma (  3) 
Gamma < 4> 
Gamma< 9) 
Gamma < 13) 
Gamma  <  30 ) 
Normal 
Loqi *t i c 


KS 

A2 

u2 

.443 

.528 

.528 

.812 

.851 

.881 

.058 

.039 

.059 

.0?? 

.178 

.149 

.099 

.129 

.149 

.049 

.129 

.109 

.059 

.089 

.089 

.049 

.049 

.049 

.118 

.049 

.109 

.041 

.048 

.050 

TABLE  XII 


COMPARISON  OF  KOLMOGORCW-SMI RNOW ,  ANDERSON-DARLING 
AND  CRAMER-VON  MISES  STATISTICS 
FOR  SAMPLE  SIZE  OF  2S,  AND  SIGNIFICANCE  LB/EL  OF  .09 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


KS 

A2 

w2 

Un i form 

.545 

.584 

.634 

Exponent i *1 

.832 

.891 

.891 

Uleibull  <3> 

.058 

.049 

.078 

Gamn*<3> 

.099 

.208 

.139 

(3 amma<  6) 

.129 

.139 

.129 

3aiwna<9> 

.109 

.129 

.129 

Gamma ( 15) 

.089 

.139 

.109 

Gamma<30> 

.089 

.069 

.089 

Normal 

.040 

.020 

.040 

Log j  mt i c 

.050 

.039 

.039 

TABLE  XIII 


COMPARISON  OF  KOLMOGOROV-SMI RNOV ,  ANDERSON-DARLING 
AND  CRAMER-CON  MISES  STATISTICS 
FOR  SAMPLE  SIZE  OF  30,  SIGNIFICANCE  LEVEL  OF  .03 


PERCENT  OF  TIME  NULL  HYPOTHESIS  REJECTED 


Un i Form 
Exponent i al 
Ucibul 1(3) 
Gamma <  3) 
Gamma ( 6) 
Gamma ( 9) 
Gamma < 1 3) 
Gamma < 30) 
Normal 
Log i at i c 


KS 

1 - 

A2 

w2 

.604 

.673 

.624 

.931 

.980 

.980 

.048 

.066 

.079 

.218 

.386 

.317 

.109 

.248 

.198 

.099 

.119 

.109 

.069 

.109 

.089 

.119 

.079 

.107 

.040 

.050 

.059 

.050 

.050 

.051 

k.  • 


* 

L\\ 


PROGRAM  VALUES 

******************************************************* 

* 

*  DEFINITION  OF  VARIABLES  IN  MAIN  PROGRAM 

* 

»  PURPOSE:  GENERATE  A  SAMPLE  FROM  THE  LOGISTIC  DISTRIBUTION 

* 

* 

* 

* 

* 

* 

* 

*  MM= 

*  SS= 

*  M2= 

*  SD2= 

*  N= 

*  X< J)= 

*  2  <  J)  = 

*  PI  E= 

*  CONST= 

*  SEED= 

*  GGUBFS=  PSUEDORANDOM  NUMBER  GENERATOR  (IMSL) 

*  VSRTA=  SORT  ROUTINE  (IMSL) 

*  FX=  HYPOTHETICAL  SAMPLE  DISTRIBUTION 

*  SX=  EMPIRCIAL  SAMPLE  DISTRIBUTION 

* 

******************************************************* 

* 

REAL  KS,A2,W2,AA,WW,BB,SSQ,X<30) ,Z(30) ,FX(30) ,SX(30> 

REAL  XXKS< 5001 ) ,XXA2<5001 )  ,XXUJ2(5001  ) ,YY<5001 ) 

REAL  M2,SD2,MM,SS,SUM1  ,  SLff>12  ,  PI  E ,  CONST ,  DI FF  <  30  ) 

INTEGER  N, I ER, LOOP, REP 
DOUBLE  PRECISION  SEED 
EXTERNAL  GGUBFS ,  VSRTA 
COMMON  N, CONST, PIE, LOOP 

* 

***************  INITIALIZATION  ******************** 

* 

OPEN<  3 ,  FI  LE="VALOUT"  ,  STATUS="NELI"  , FORM=" UNFORMATTED"  > 

WRITEC3, 101 ) 

101  FORMAT < 2X , "  CRITICAL  VALUES  FOR  LOGI STI C< 1 00 , 625)  FOR  J0HN5  ") 

* 

SEED=453689621 .DO 
CONST=3.**0 .5 
PIE=3. 14159265358 
N=0 

9995  N=N+5 

I F(N . GT . 30 ) GOTO  9999 
WRITE<  3, 51 1 )N 

511  FORMAT <////2X , "  SAMPLE  SIZE=  ",I5) 

DO  5  1=1 ,N 
X<I)=0. 

5  CONTINUE 


OBTAIN  THE  MLE  ESTIMATES  OF  THE  LOCATION  AND  SCALE, 
DETERMINE  THE  HYPOTHETICAL  AND  EMPIRICAL  DISTRIBUTION 
OF  THE  SAMPLE,  AND  CALCULATE  THE  KOLMOGOROV-SMI RNOV , 
ANDERSON-DARLING  &  CRAMER-VON  MISES  STATISTICS  FOR  THE 
SAMPLE.  REPEATE  THIS  OPERATION  5000  TIMES  FOR  EACH  SAMPL 
SIZE  OF  5,10,15,20,25,  &  30 

SAMPLE  MEAN 

SAMPLE  STANDARD  DEVIATION 

ITERATIVE  EST  OF  MEAN  RETURNED  BY  MLE 

ITERATIVE  EST  OF  STANDARD  DEVIATION  RETURNED  BY  MLE 

#  IN  SAMPLE 

ARRAY  OF  LOGISTIC  DEVIATES 
ARRAY  OF  STANDARDIZED  DEVIATES 
ARITHMATI C  VALUE 
ARITHMATI C  VALUE 

INITIAL  VALUE  FOR  RANDOM  NUMBER  GENERATOR 


k.« 


ftli 


78 


r-r 


L: 


I 

I,’ V 


B 


I 


M 


* 


K '•• 

i  •  - 


* 

**************  START  MAJOR  00  LOOP  OF  5000  REPS  ******** 

* 

REP=5000 
DO  4  1=1 , REP 
XXKS< I >=0 . 

XXA2( I ) =0 . 

XXW2< I >=0 . 

4  CONTINUE 

* 

DO  10  LOOP=l , REP 

* 

**************  START  MINOR  DO  LOOPS  BASED  ON  N  *********** 
* 

* 

**************  CALCULATE  LOGISTIC  DEVIATES 
* 

RN=0  . 

AA=0  • 

COUNT =0 

2000  DO  20  J=1 ,N 

RN=GGUBFS<  SEED) 

AA=LOG( ( 1 . -RN)/RN) 

X< J)=100.0-<  <  <25.0*C0NST)*AA)/PIE) 

20  CONTINUE 

* 

************  INITIAL  ESTIMATES  USING  SAMPLE  MEAN  &  STD  DEV 
* 

SUM 1=0 . 

SUM 2=0 . 

DO  30  J=1 ,N 

SUM 1 =SUM 1 +X  <  J  > 

30  CONTINUE 

* 

MM=0. 

MM=SUM1/N 

* 

BB=0  . 

DO  40  J=1 ,N 

BB=<X( J) -MM) **2 . 

SUM2=SUM2+BB 
40  CONTINUE 

* 

SSQ=0 . 

SS=0  . 

SSQ=SUM2/<N-1 ) 

SS=SSQ**0 .5 

* 

****************  CALL  MLE  SUBROUTINE 
* 

CALL  MLE<X,MM,SS,M2,SD2, IER) 

50?  FORMAT </2X , '  REP=  ',15) 

IF< IER.EQ. 1 )THEN 
COUNT=COUNT+ 1 
WRITE( 3,50?) LOOP 


7? 


WRIT£<  3 , 520 ) 

520  FORMAT ( 2X , '  MLE  COULD  NOT  CONVERGE,  TRYING  NEW  SAMPLE  ' ) 

GOTO  2000 
ENDIF 

* 

COUNT  1=0 

IF< I ER . EQ . 3) THEN 
COUNT 1 =COUNT 1+1 
WRITE( 3 , 509) LOOP 
WRITE<  3 , 522) 

522  FORMAT <2X, '  MLE  EST  OF  S.D.  LE  0.1  ') 

GOTO  2000 
ENDIF 

* 

****************  CALCULATE  F<X)  AND  S<X> 


CALL  VSRTA<X,N) 

DO  88  K=1  ,N 
FX<K)=0 . 

2  <  K)=0 . 
CONTINUE 


DO  50  J=1  ,N 

2 <  J)  =  <  PI E*<X<  J) -M2) >/(SD2*C0NST> 
FX( J>  =  < 1 ./< 1 . +EXP< -2 <  J) ) >  ) 

50  CONTINUE 

* 

***************  CALCULATE  KS,  A<SQ)  &  W(SQ) 
* 

CALL  STATS <  DI FF , FX , A2 , W2 , KS> 

XXKS  <  LOOP  >  =KS 
XXA2  <  LOOP ) =A2 
XXW2<L00P)=W2 

* 

********************  end  of  major  loop 
* 

10  CONTINUE 
* 

CALL  VSRTA< XXKS, REP) 

CALL  VSRTA<XXA2 , REP ) 

CALL  VSRTA<XXW2 , REP) 


YY  <  REP+ 1 )  =  1 .0 
DO  90  K=1 , REP 

YY  <  K)  =  <  K- . 5) /REP 
CONTINUE 

CALL  ENDPT< REP, XXKS, YY, POINT) 
XXKSCREP+l )=POINT 

CALL  ENDPT < REP, XXA2,YY, POINT) 
XXA2<  REP+ 1 )=POINT 

CALL  ENDPT < REP, XXW2,YY, POINT) 


O  *  *  *  -  «  ■*  ,  1  .  • 


****************************************************** ******* 
************************************************************* 
************************************************************* 
************************************************************** 
* 

*  DEFINITION  OF  VARIABLES  IN  SUBROUTINE  MLE 

* 

*  PURPOSE:  TO  FIND  A  SOLUTION  TO  L'(X)=0  GIVEN  INITIAL 

*  APPROXIMATIONS  M2  L  SD2 

* 

*  MM=  SAMPLE  MEAN 

*  SS=  SAMPLE  STD  DEV 

*  Ml  =  OLD  ITERATIVE  EST  MEAN 

*  M2=  NEW  ITERATIVE  EST  MEAN 

*  M3=  CANDIDATE  EST  MEAN 

*  SD1=  OLD  ITERATIVE  EST  STD  DEV 

*  SD2=  NEW  ITERATIVE  EST  STD  DEV 

*  SD3=  CANDIDATE  EST  STD  DEV 

*  PI E=  ARITHMATI C  VALUE 

*  CONST=  ARITHMATIC  VALUE 

*  X(N)  =  ARRAY  OF  DEVIATES 

*  TOL=  TOLERANCE  FOR  5  SIGNIFICANT  DIGITS 

*  GK 1  )=  L' (Ml )  SD2  KNOWN 

*  Q<2>=  L' (M2)  SD2  KNOWN 

*  GK3>=  L'(SDl>  M2  KNOWN 

*  Q<4>=  L' < SD2)  M2  KNOWN 

*  AE1&2=  ABSOLUTE  ERRORS 

*  N=  SAMPLE  SIZE 

*  I ER=  CONVERGENCE  ERROR 

* 

************************************************************** 

* 

SUBROUTINE  MLc'(X  ,MM ,  SS  ,M2 ,  SD2 , 1  ER> 

REAL  MM , SS ,M1 ,M2,M3,SD1 , SD2 , SD3 , PI E ,X(N> ,TOL, CONST 
REAL  CC , DD , EE , GG , HH , Q< 4) ,SUM(6> ,AE1 ,AE2 
INTEGER  N ,  I  ER 
COMMON  N,  CONST,  PIE,  LOOP 

* 

**********  INITIAL  VALUES 
* 

TOL=0 .000005 
Ml  =0  . 

M2=MM 
M3=0  . 

SD3=0 . 

SD2=SS 
SDl=SD2+0 .5 

* 

**********  START  SECANT  MTHD 
* 

* 

DO  5  K=1 ,50 

* 

SUM< 1 ) =0 . 

SUM<  2>=0 . 


DO  10  1=1 ,N 

DD=<  <X<  I >-Ml >*PIE)/<SD2*C0NST> 

EE=< <X< I ) -M2) *PI E)/<  SD2*C0NST) 

SUM< 1 >=SUM< 1 >  +  <  EXP< -DD>/( 1 +EXP< -DD) > ) 
SUM<2)=SUM<2)+<EXP<-EE>/< 1+EXP<-EE> ) ) 

10  CONTINUE 
* 

***************  1ST  DERIVATIVE  VALUES  FOR  MEAN 
* 

cc=o . 

CC=P I E/ ( SD2*C0NST ) 

* 

Q< 1 )=0 . 

Q(2)=0. 

GK  1  >=CC*<N-<2.0*SUM< 1 ) ) ) 

GK  2)=CC*<N-<  2 . 0  *SUM<  2) ) > 

* 

M3=M2-(Q<2)*<  CM2-M1 )/(  GK  2) -GK 1 >  >  > ) 
AE1=ABS<M3— M2> 

* 

M1=M2 

M2=M3 

* 

SUM( 3)=0  . 

SUM<4)=0. 

SUM<  5)=0 . 

SUM<6)=0 . 

GG=0  . 

HH=0  . 

* 

DO  20  J=1  ,N 

GG=<  <X< J)-M2)*PIE)/<SD1*C0NST) 

HH=<  <X< J)-M2)*PIE)/<SD2*C0NST) 

SUM<  3)=SUM<  3) +GC 
SUM(4)=SUM(4)+HH 

SUM<  5)=SUM<  5)  +  <  <  GG*EXP< -GG> )/( 1  .  +  EXPC-GG) ) ) 
SUM<  6)=SLffi( 6)  +  <  <  HH*EXP( -HH) )/< 1  .+EXP(-HH> ) ) 
20  CONTINUE 
* 

***************  1ST  DERIVATIVE  VALUES  FOR  STD  DEV 
* 

Q<  3)=0  . 

Q(4)=0  . 

Q(3)  =  (l  ./SD1  )  *(  SUM<  3)  -  <  2 . 0*SLSi(  5)  )  -N) 

Q(4)=< 1 ./SD2)*<SUM(4)-<2.0*SUM<6) )-N> 

* 

SD3=SD2-(Q(4) *<  CSD2-SD1 )/( Q( 4) -Q< 3) ) ) ) 
AE2=ABS<  SD3-SD2 ) 


I  F( SD3 . LE . 0 . 1 )THEN 
I  ER=3 


* 

* 


in  * 


* 


# 


1000 

* 

1200 


GOTO  1200 
ENDIF 

SD1=SD2 

SD2=SD3 

I F< AE1 .LT.TOL. AND. AE2.LT.T0L> GOTO  1000 
CONTINUE 

I  F<  K .  GE  .  50  .  AND .  AE1  .  GT  .TOL .  OR .  AE2 .  GT  .TODTHEN 
I  ER=  1 
GOTO  1200 
ENDIF 
I  ER=0 

RETURN 


*  N) 


************************************************************ 

************************************************************* 

************************************************************* 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


DEFINITION  OF  VARIABLES  IN  SUBROUTINE  STATS 

PURPOSE:  TO  CALCULATE  KOLMOGOROV- SM I RNOV ,  ANDERSON- 
DARLING,  6c  CRAMER-VON  MISES  STATISTICS  FOR  A  GIVEN  SAMPLE 

N=  #  IN  SAMPLE 

FX=  HYPOTHETICAL  DISB  OF  SAMPLE 

SX=  EMPIRICAL  DISB  OF  SAMPLE 

KS=  KOLMOGOROV-SMIRNOV  STATISTIC 
A 2=  ANDERSON-DARL I NG  ST AT I  ST I C 

W2=  CRAMER-VON  MISES  STATISTIC 


************************************************************* 


* 

SUBROUT I NE  STATS <  D I FF , FX , A2 , W2 , KS  > 
REAL  A2,KS,W2,FX<N> ,DIFF<N> , ONE, TWO 
INTEGER  N 

COMMON  N, CONST, PIE, LOOP 

* 

**********  CALCULATE  KS  STATISTIC 
* 

DO  2  1  =  1  ,N 
DIFF< I >=0 . 

CONTINUE 


DO  10  1=1 , N 
0NE=0 . 

TWO=0 . 

Q=  I  *  1 

QQ=<  Q- 1 )/N 
QQQ=Q/N 

ONE=ABS(FX< I ) -QQ) 

TWO=ABS ( FX  < I ) -QQQ ) 

I  F<  ONE .  GT  .  TWO) THEN 
DIFF< I )=ONE 

ELSE 

DIFF( I >  =TWO 
END  IF 

10  CONTINUE 

* 

KS=0  . 

DO  20  1=1 , N 

IF<DIFF( I ) . GT . KS ) KS=D I FF ( I > 
20  CONTINUE 


**********  CALCULATE  A(SQ)  STATISTIC 

* 


30 


1 


SUM=0  . 

DO  30  1  =  1  ,N 

SUM=SUM+  C  (  2.  *n  -1  .  >  *(.  (LOG(  FX<  I  )  )  >  +  <  LOG(  1  .  -FX(N+  1  - 1  )  )  )  > 
CONTINUE 
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AD-A138  098 

UNCLASSIFIED 


MODIFIED  KOLMOGOROV-SMIRNOV  ANDERSON-DARLING  AND  2/2 

CRAMER-VON  MISES  TESTS  F  <U>  AIR  FORCE  INST  OF  TECH 
WRIGHT-PATTERSON  AFB  OH  SCHOOL  OF  ENGI.  .  J  D  VODER 
16  DEC  81  RFIT/GOR/ENC/82D-7  F/G  12/1  NL 


MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BUREAU  OF  STANDARDS -1963 -A 


A  2=0 . 

A2=< -N) -<  SUM/N) 


#*#****#*  CALCULSTE  UKSQ>  STATISTIC 
* 

SUM=0. 

DO  40  1=1 ,N 

SUM=SUM+<FX<I >-<<2.*I >-1 . )/< 2 . *N>  > **2 . 
40  CONTINUE 
* 

U2=0. 

UJ2=<  1  ./<  12.*N>>+SUM 

* 

RETURN 

END 


************************************************************** 

************************************************************** 

* 

*  SUBROUTINE  ENDPTs  PURPOSE;  TO  DETERMINE  THE  LAST  POINT 

*  IN  A  GIVEN  SERIES  OF  STATISTICS 

* 

*  REP=  #  OF  POINTS  IN  THE  SERIES 

*  M=  SLOPE 

*  B=  INTERCEPT 

* 

*************************************************************** 

* 

SUBROUTINE  ENDPT< REP , XX ,YY , POINT) 

REAL  XX< REP) ,YY< REP) , POINT, M,B 
INTEGER  REP 

* 

M=0. 

M=< YY<  REP) -YY  <  REP-1 ) )/<XX< REP) -XX< REP-1 ) ) 

B=0 . 

B=YY< REP- 1 >-<M*XX< REP-1 >> 

POINT=< 1 . -B)/M 

* 

RETURN 

END 

****************************************************************** 

***************************************************************** 


* 

* 

* 

* 

w 

SUBROUTINE 

CV:  PURPOSE; 

TO  DETERMINE  THE  80 , 85, 90 , 95, 99TH 
PERCENTILES  OF  THE  DISTRIBUTION  OF 
THE  GIVBEN  STATISTIC 

* 

REP= 

#  OF  POINTS 

IN  SERIES 

* 

M= 

SLOPE 

* 

B=* 

INTERCEPT 

* 

* 

PER* 

PERCENTILE 

************************************************************* 

* 

SUBROUTINE  CV< REP , XX , YY > 

REAL  PER,YY<REP+1 ) ,XX<REP+1> ,M,B,P80 ,P85,P90 ,P95,P99 
INTEGER  REP 

* 

P80=0 . 

P85=0. 

P90=0 . 

P95=0. 

DO  10  J=80 , 95 , 5 
DO  20  K=2,REP 

I F<YY<  K> . GE . <  J/ 100.)) THEN 
M=0  . 

B=0 . 

PER=0 . 

M»(YY<K)-YY(K-i ) >/<XX< K> -XX< K-l ) ) 

B=YY ( K-l ) -<M*XX<  K-l ) ) 

PER“<  <  J/l 00 . ) -B)/M 
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CK  h> 


'  P  'P'W 


GOTO  666 
ENDIF 

0  CONTINUE 

66  I F  <  J . EQ . 80  >  P80=PER 

IF<J.EQ.85)P85=PER 
I F<  J . EQ . 90  >  P90=PER 
IF< J.EQ.95)P95=PER 
10  CONTINUE 

* 

P99=0 . 

DO  30  K=2f REP 

I F< YY<K> .GE.O .99>THEN 
M=0  . 

B=0. 

H=  <  YY  <  K ) - YY  <  K - 1 ) >/<XX< K > -XX< K- 1 > > 

B=YY ( K-l )-<M*XX(K-l ) ) 

PER=<0 .99-B)/M 
GOTO  777 
ENDIF 

30  CONTINUE 
* 

777  P99*=PER 
* 

WRITE<  3 , 800 ) 

800  FORMAT < ///2X , '  80,85,90,95,99  PERCENTILES 

WRITE <  3 , 802) P80 , P85 , P90 , P95 , P99 
802  FORMAT  <  2X , 5<  FI  0 . 7 , 2X) ) 

* 

RETURN 

END 
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.  • 

•  *■  *  *  - 


PROGRAM  POWER 

REAL  KS,A2,W2,AA,WW,BB,SSQ,X<30> ,Z<30> , FX< 30 > , Y< 30 ) 
REAL  M2 , SD2 ,MM , SS , SUM1 , SUM2 ,PI E , CONST , DI FF< 30 > 

REAL  KS95<6> ,A295<6> ,W295<6) 

INTEGER  N,IER, LOOP, REP 
DOUBLE  PRECISION  SEED 
EXTERNAL  GGUBFS , VSRTA , GGEXN , GGNQF 
COMMON  N, CONST, PIE, LOOP 

* 

***************  INITIALIZATION  ****■#****«*•**###*■*** 

* 

OPEN <  3 , F I LE= ' NORMOUT "  , STATUS="NEW" , FORM* "UNFORMATTED' > 
URITE(3, 101 ) 

101  FORMAT < 2X , "  CRITICAL  VALUES  FOR  NORMAL  "> 


SEED-453689421  .DO 
CONST-3. **0.5 
PIE-3. 14159245358 
N=0 

9995  N-N+5 

IF<N.6T .30) GOTO  9999 
WRITE(3,51 1 )N 

511  FORMAT <////2X,"  SAMPLE  SIZE*  ",I5> 

DO  5  1*1 ,N 

X<I)*0. 

Y< I >=0 . 

Z<I)*0. 

5  CONTINUE 
DO  6  1*1,6 

KS95< I )*0 . 

A295< I )*0 . 

W295( I )— 0 . 

6  CONTINUE 


* 

2000 


REP-1000 

DO  10  LOOP* 1, REP 

DO  20  J-l ,N 

Y< I )*GGNQF<  SEED) 
X<I>*<Y<I) *2 . )  + 1 . 
CONTINUE 


INITIAL  ESTIMATES  USING  SAMPLE  MEAN  &  STD  DEL* 


SUM 1—0 . 

SUM 2=0 . 

DO  30  J=1 ,N 

SUM1~SUM1+X< J) 
CONTINUE 


MM-0. 

MM-SUM1/N 


\  V  /. 


BB— 0 


DO  40  J«1 ,N 

BB=<X( J)-MM>**2. 
SUM2=SUM2+BB 
CONTINUE 

SSQ=0. 

SS=0. 

SSQ=SUM2/<N-1 ) 
SS=SSQ**0 . 5 


****************  CALL  MLE  SUBROUTINE 

* 

CALL  MLE(X,MM , SS ,M2 , SD2 , IER> 
I F( I ER . EQ . 1 ) GOTO  2000 
I F< I ER . EQ . 3) GOTO  2000 


****************  CALCULATE  F<X)  AND  S<X> 


CALL  VSRTA<X,N> 

DO  85  J=1 ,N 

Z< J)=<PIE*<X< J)-M2>  >/< SD2*C0NST> 
FX<J>=<1 ./<1 .+EXP<-Z< J> ) >  > 
CONTINUE 


*************** 

* 


CALCULATE 


A<SQ>  &  W<SQ> 


CALL  STATS<DIFF,FX,A2,W2f KS) 
IF<N.EQ.5.AND.KS.GT.0.309)KS95<1)=KS95<1>+1 
I F<N . EQ . 5 .AND .A2.GT.0 . 420  > A295< 1 >=A295< 1  )  + 1 
IF<N. EQ . 5. AND.U2.GT. 0 .096)W295< 1 >=W295( 1 )+ 1 
I F<N . EQ . 1 0 .AND . KS . GT . 0 . 230  >  KS95<  2)=KS95<  2)  +  1 
IF(N.EQ. 10 .AND.A2.GT.0 . 640>A295(2)=A295<2>+1 
IFCN.EQ . 1 0 .AND .W2.GT .0 .095>W295<2>=W295<  2>  + 1 
IF<N.EQ. 15.AND.KS.GT .0.191 ) KS95< 3)=KS95(3) + 1 
IF<N.EQ. 15.AND.A2.GT.0 . 665) A295( 3>=S295< 3) ♦ 1 
IF(N.EQ. 15.AND .W2.GT . 0 . 098>W295<  3)=W295<3>+1 
IF(N.EQ.20  .AND.KS.GT.0 . 170>KS95<4)=KS95<4)+1 
IF(N.EQ.20 .AND.A2.GT.0 .663>A295<  4>=A295<4>  +  1 
I F<N . EQ . 20 . AND . W2 . GT . 0 . 097) W295<  4>»W295<  4)  + 1 
I F<N. EQ . 25 .AND . KS . GT . 0 . 655) KS95<  5>=KS95<  5)  + 1 
IF(N. EQ . 25 .AND.A2.GT . 0 . 663) A295<5)=A295<5>  + 1 
IF<N.EQ.25.AND.W2.GT.0.098)W295<5)=W295<5)+1 
I F<N . EQ . 30 .AND . KS . GT . 0 . 1 41 >  KS95<  4)=KS95<  4)  + 1 
IF(N.EQ.30 .AND.A2.GT.0 . 649)A295< 4)=A295< 6) + 1 
IF<N.EQ.30  .AND.U2.GT.0  .098)W295< 6)=UJ295<  6)  + 1 


********************  END  OF  MAJOR  LOOP 

* 

10  CONTINUE 


DO  89  K-l ,6 


1 


j 

888 

A*KS95<K>/1000. 

B-A295< K>/1000. 

C*S295<K>/1000. 

W=K*5 

WRITE(3,51 1  >U 

WRITE< 3 , 888) A , B , C 

FORMAT  <  //2X ,  '  KS*/=  ',F6.3,'  A2/.=  ',F6.3,'  W2X=  ',F6. 3> 

i 

89 

CONTINUE 

9999 

GOTO  9995 

ENDFILE<  3) 

CLOSE (3) 

************************************************************* 
SUBROUTINE  MLE<X,MM,SS,M2,SD2, IER) 

REAL  MM,SS,M1 ,M2,M3,SD1 , SD2 ,SD3 , PI E ,X(N> ,TOL, CONST 
REAL  CC , DD , EE , 66 , HH , Q ( 4  > ,SUM<6> ,AE1 ,AE2 
INTE6ER  N, IER 
COMMON  N, CONST, PIE, LOOP 

* 

**********  INITIAL  VALUES 
* 

TOL=0. 000005 
Ml—0 . 

M2=MM 

M3*0. 

SD3— 0 ■ 

SD2=SS 

SDl=SD2+0.5 

* 

**********  START  SECANT  MTHD 
* 

* 

DO  5  K= 1 ,50 

* 

SUM< 1 )=0 . 

SUM<  2)=0 . 

DD=0. 

EE=0  . 

* 

DO  10  1=1 ,N 

DD=<  <X< I )-Ml ) *PI E)/< SD2*C0NST> 

EE=<  <X< I )-M2)*PIE)/<SD2*C0NST> 

SUM< 1 )=SUM< 1 )  +  < EXP<— DD>/< 1 +EXP( -DD>  > ) 

SUM<  2>*SUM<  2)  +  <  EXP< -EE)/< 1 +EXP< -EE>  > ) 

10  CONTINUE 
* 

***************  1ST  DERIVATIVE  FUNCTIONAL  VALUES  FOR  MEAN 
* 

cc=o . 

CC=P I E/ <  SD2*C0NST ) 

* 

QC1)=0. 

Q<2)=0. 

Q< 1 )=CC*<N-<  2 . 0*SUM< 1 ) ) ) 

Q<2>=CC*<N-<2.0*SUM<2>  >> 

* 

M3=M2-<  Q<  2>  *< (M2-M1 )/( Q< 2) -Q< 1 ) ) ) ) 

AE1=ABS<M3-M2> 

* 

M1*M2 

M2-M3 

* 

SUM<3)-0. 

SUM<  4)=0 . 

SUM<5)»0. 

SUM<6>=0. 


I 


DO  20  J-l ,N 

GG=<  <X< J>-M2>*PIE>/<SD1*C0NST> 

HI+«<<X<  J)-M2>*PIE)/<SD2*C0NST> 

SUM<  3)=SUM<  3>  +  GG 
SUM<  4)=SUM<  4) +HH 

SUM<  5>=SUM<  5>  +  <  < GG*EXP< -GG) )/<  1 .+EXP<-GG)>) 

SUM<  4>=SUM( 6>  +  (  < HH*EXP< -HH>  >/< 1 . +EXPC -HH> > > 

20  CONTINUE 
* 

***************  1ST  DERIVATIVE  FUNCTIONAL  VALUES  FOR  STD  DEV 
* 

Q<  3>— 0  . 

GK  4>=0 . 

Q<3)=<1 ./SD1 )*<SUM<3)-<2.0*SUM(5))-N> 

Q<4>=<1  ./SD2)  *<SUM<  4) -<  2 . 0*SUM<  <S>  )-N) 

* 

SD3=SD2-<  Q<  4) *<  <  SD2-SD1 )/<  Q<  4) -Q<  3>  > ) ) 

AE2=ABS <  SD3-SD2 ) 


I F  <  SD3 . LE . 0 . 1 ) THEN 
IER=3 
GOTO  1200 
END  IF 

SD1=SD2 

SD2=SD3 

I F<AE1 .LT.T0L.AND.AE2.LT.T0L  >GOTO  1000 
CONTINUE 

I  F<  K .  GE .  50  .AND . AE1  .  GT  .TOL .  OR . AE2 .  GT  .TODTHEN 
IER=1 
GOTO  1200 
ENDIF 
I  ER=0 


1000 


1 200  RETURN 


•  -  *  —  *  _  4  _  •  _  i 


************************************************************ 

************************************************************* 

* 

SUBROUTINE  STATS< DI FF , FX , A2 ,W2 , KS> 

REAL  A2,KS,U2,FX<N> ,DIFF<N> , ONE, TWO 
INTEGER  N 

COMMON  N, CONST, PIE, LOOP 

* 

**********  CALCULATE  KS  STATISTIC 
* 

DO  2  1=1 ,N 
DIFF< I )=0 . 

2  CONTINUE 

* 

DO  10  1=1 ,N 
0NE=0 . 

TWO=0. 

0=0. 

QQ=0. 

QQQ=0 . 

0=1*1 

QQ=<  Q-l )/N 
QQQ=Q/N 

ONE=ABS<FX< I )-QQ) 

TUO=ABS<  FX< I ) -QQQ) 

I F < ONE . 6T . TWO ) THEN 
DI FF< I >=ONE 

ELSE 

DI FF  < I )=TWO 
END  IF 

10  CONTINUE 

* 

KS=0. 

DO  20  1=1 ,N 

IF<DIFF< I ) . GT . KS) KS=DI FF ( I ) 

20  CONTINUE 

* 

**********  CALCULATE  A<SQ>  STATISTIC 
* 

SUM=0. 

DO  30  1=1 ,N 

SUM=SUM+< <2.*I )-l . )*<  LOG(FX< I ) ) )+<LOG< 1 .-FX(N+1-I ) ) ) ) 
30  CONTINUE 

* 

A 2=0  . 

A2=  <  -N  )  -  <  SUM/N  ) 

* 

*********  CALCULSTE  W<SQ)  STATISTIC 
* 

SUM=0. 

DO  40  1  =  1  ,N 

SUM=SUM+<FX< I )-< ( 2 . *1 ) -1 . >/<  2 . *N) ) #*2 . 

40  CONTINUE 

* 

W2=0. 


9 


U2=< 1 ./< 1 2 . *N> ) +SUM 


RETURN 
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Trie  •  ’  .0.:  of  maximum  likelihood  is  used  to  det>  Mine  invariant 

cat.  i  , .  t  .  of  the  uni. uown  location  and  scale  par  i  ters  of  a 
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of  tie  partial  derivatives.  Using  these  estima1  a  modified 

i.o  1  o  .irov-ami  rnov ,  Anderson-Darling  and  Cramer  on  Mises  statistics 

are  ■  .  i culated  for  a  given  sample.  This  proced  <  is  repeated 

bo  a  times  for  sample  sizes  of  n=  5(5)30.  The  <th,  85th,  90th, 

95tn  and  99tn  percentiles  of  the  distribuition  <  f  each  statistic, 
for  caci  sample  size,  is  then  calculated.  Thest  values  are  then 
i¥sud  t,o  generate  tables  of  critical  values  for  t  ,e  Logistic  distri¬ 
bution  with  unknown  location  and  scale  parameter..  A  power  com¬ 
parison  oetween  tne  three  tests  is  performed’ usi  ig  samples  from 
various  distributions. 

The  decant  method  retires  "good”  initial  esti  i. -t.es  of  the  para¬ 
meters  in  order  to  converge.  This  thesis  uses  ■  ie  sample  mean 
and  standard  deviation  as  initial  estimates.  Ii  four  of  the  total 
30,vf)0  samples  used,  these  initial  estimates  die  not  allow  con- 
vergen  e.  While  discarding  these  samples  biases  the  theoretical 
result  ,  it  was  determined  that  discarding  thest  samples  would 
not  biases  trie  numerical  results.  This  does  ho  -ver  place  a 
constraint  on  using  the  decant  method  with  resp<  it  to  obtaining 
maximum  likelihood  estimates  of  the  parameters.  The  power  of 
these  tests  Cor  non-symmetrically  convex  distrii  itions  is  very 
good.  However,  for  symmetrically  convex  distrii  itions,  the 
power  ranges  from  moderate  to  only  slightly  more  than  the 
signi f Loanee  level. 
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